Optimal transform for segmented parametric speech coding
نویسندگان
چکیده
In voice coding applications where there is no constraint on the encoding delay, such as store and forward message systems or voice storage, segment coding techniques can be used to achieve a reduction in data rate without compromising the level of distortion. For low data rate linear predictive coding schemes, increasing the encoding delay allows one to exploit any long term temporal stationarities on an interframe basis, thus reducing the transmission bandwidth or storage needs of the speech signal. Transform coding has previously been applied in low data rate speech coding to exploit both the interframe and the intraframe correlation [9][2]. This paper investigates the potential for optimising the transform for segmented parametric representation of speech.
منابع مشابه
Adaptive transformation for segmented parametric speech coding
In voice coding applications where there is no constraint on the encoding delay, segment coding techniques can be used to achieve a reduction in data rate. For low data rate linear predictive coding schemes, increasing the encoding delay allows one to exploit any long term temporal stationarities on an interframe basis, thus reducing the transmission bandwidth or storage needs of the speech sig...
متن کاملImprovements to the Switched Parametric & Transform Audio Coder
In this paper, we introduce improvements to previous sines + transients + noise audio modeling systems, including new sinusoidal trajectory selection and quantization procedures. In previous work [1], the audio is first segmented into transient and non-transient regions. The transient region is modeled using traditional transform coding techniques, while the non-transient regions are modeled us...
متن کاملEfficient Block Coding of Images Using Gaussian Mixture Models
An efficient method for block coding of speech was presented by Rao and Subramaniam in [7]. An adaptation of this method for the use in image coding is presented in this paper. The probability density function (PDF) of the image blocks is estimated and modelled as multivariate Gaussian mixtures using the k-means and Expectation-Maximisation (EM) algorithms. This parametric model is incorporated...
متن کاملError Protection and Concealment for HILN MPEG-4 Parametric Audio Coding
The HILN (Harmonic and Individual Lines plus Noise) MPEG-4 parametric audio coding tool allows efficient representation of general audio signals at very low bit rates. Therefore possible applications include transmission over IP or wireless channels which are both characterised by specific transmission error models. On the other hand, since parametric audio coding is a relatively new technique ...
متن کاملSpectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform
In this paper, the use of optimal KarhunenLoeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Hu...
متن کامل